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Abstract: In this paper, we obtain general representations for the joint dis- 
tributions and copulas of arbitrary dependent random variables absolutely 
continuous with respect to the product of given one-dimensional marginal dis- 
tributions. The characterizations obtained in the paper represent joint distribu- 
tions of dependent random variables and their copulas as sums of [/-statistics 
in independent random variables. We show that similar results also hold for 
expectations of arbitrary statistics in dependent random variables. As a corol- 
lary of the results, we obtain new representations for multivariate divergence 
measures as well as complete characterizations of important classes of depen- 
dent random variables that give, in particular, methods for constructing new 
copulas and modeling different dependence structures. 

The results obtained in the paper provide a device for reducing the analysis 
of convergence in distribution of a sum of a double array of dependent random 
variables to the study of weak convergence for a double array of their inde- 
pendent copies. Weak convergence in the dependent case is implied by similar 
asymptotic results under independence together with convergence to zero of 
one of a series of dependence measures including the multivariate extension 
of Pearson's correlation, the relative entropy or other multivariate divergence 
measures. A closely related result involves conditions for convergence in dis- 
tribution of m-dimensional statistics h{Xt, Xt+i, ■ ■ ■ , Xt+m—i) of time series 
{Xt} in terms of weak convergence of h{(^t,£,t+i, ■ ■ ■ , ?t+m-l), where {^t} is a 
sequence of independent copies of X'^s, and convergence to zero of measures of 
intertemporal dependence in {Xt } . The tools used include new sharp estimates 
for the distance between the distribution function of an arbitrary statistic in 
dependent random variables and the distribution function of the statistic in 
independent copies of the random variables in terms of the measures of depen- 
dence of the random variables. Furthermore, we obtain new sharp complete 
decoupling moment and probability inequalities for dependent random vari- 
ables in terms of their dependence characteristics. 
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1. Introduction 

In recent years, a number of studies in statistics, economics, finance and risk man- 
agement have focused on dependence measuring and modeling and testing for serial 
dependence in time series. It was observed in several studies that the use of the most 
widely applied dependence measure, the correlation, is problematic in many setups. 
For example, Boyer, Gibson and Lorctan [9] reported that correlations can provide 
little information about the underlying dependence structure in the cases of asym- 
metric dependence. Naturally (see, e.g., Blyth [7] and Shaw [71]), the linear corre- 
lation fails to capture nonlinear dependencies in data on risk factors. Embrechts, 
McNeil and Straumann [22] presented a rigorous study concerning the problems 
related to the use of correlation as measure of dependence in risk management and 
finance. As discussed in [22] (see also Hu [32]), one of the cases when the use of 
correlation as measure of dependence becomes problematic is the departure from 
multivariate normal and, more generally, elliptic distributions. As reported by Shaw 
[71], Ang and Chen [4] and Longin and Solnik [54], the departure from Gaussianity 
and elliptical distributions occurs in real world risks and financial market data. 
Some of the other problems with using correlation is that it is a bivariate measure 
of dependence and even using its time varying versions, at best, leads to only cap- 
turing the pairwise dependence in data sets, failing to measure more complicated 
dependence structures. In fact, the same applies to other bivariate measures of de- 
pendence such as the bivariate Pearson coefficient, KuUback-Leibler and Shannon 
mutual information, or Kendall's tau. Also, the correlation is defined only in the 
case of data with finite second moments and its reliable estimation is problematic 
in the case of infinite higher moments. However, as reported in a number of stud- 
ies (see, e.g., the discussion in Loretan and Phillips [55], Cont [11] and Ibragimov 
[33, 34] and references therein), many financial and commodity market data sets 
exhibit heavy-tailed behavior with higher moments failing to exist and even vari- 
ances being infinite for certain time series in finance and economics. A number of 
frameworks have been proposed to model heavy-tailedncss phenomena, including 
stable distributions and their truncated versions, Pareto distributions, multivari- 
ate t-distributions, mixtures of normals, power exponential distributions, ARCH 
processes, mixed diffusion jump processes, variance gamma and normal inverse 
Gamma distributions (see [11, 33, 34] and references therein), with several recent 
studies suggesting modeling a number of financial time series using distributions 
with "semiheavy tails" having an exponential decline (e.g., Barndorff Nielsen and 
Shephard [5] and references therein). The debate concerning the values of the tail 
indices for different heavy-tailed financial data and on appropriateness of their mod- 
eling based on certain above distributions is, however, still under way in empirical 
literature. In particular, as discussed in [33, 34], a number of studies continue to 
find tail parameters less than two in different financial data sets and also argue that 
stable distributions are appropriate for their modeling. 

Several approaches have been proposed recently to deal with the above problems. 
For example, Joe [42, 43] proposed multivariate extensions of Pearson's coefficient 
and the KuUback Leibler and Shannon mutual information. A number of papers 
have focused on statistical and econometric applications of mutual information and 
other dependence measures and concepts (see, among others, Lehmann [52], Golan 
[26], Golan and Perloff [27], Massoumi and Racine [57], Miller and Liu [58], Soofi 
and Retzer [73] and UUah [76] and references therein). Several recent papers in 
econometrics (e.g., Robinson [66], Granger and Lin [29] and Hong and White [31]) 
considered problems of estimating entropy measures of serial dependence in time 
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series. In a study of multifractals and generalizations of Boltzmann-Gibbs statistics, 
Tsallis [75] proposed a class of generalized entropy measures that include, as a par- 
ticular case, the Hellinger distance and the mutual information measure. The latter 
measures were used by Fernandes and Flores [24] in testing for conditional indepen- 
dence and noncausality. Another approach, which is also becoming more and more 
popular in econometrics and dependence modeling in finance and risk management 
is the one based on copulas. Copulas are functions that allow one, by a celebrated 
theorem due to Sklar [72], to represent a joint distribution of random variables 
(r.v.'s) as a function of marginal distributions (see Section 3 for the formulation of 
the theorem). Copulas, therefore, capture all the dependence properties of the data 
generating process. In recent years, copulas and related concepts in dependence 
modeling and measuring have been applied to a wide range of problems in eco- 
nomics, finance and risk management (e.g., Taylor [74], Fackler [23], Frees, Carriere 
and Valdez [25], Klugman and Parsa [46], Patton [61, 62], Richardson, Klose and 
Gray [65], Embrcchts, Lindskog and McNeil [21], Hu [32], Rciss and Thomas [64], 
Granger, Terasvirta and Patton [30] and Miller and Liu [58]). Patton [61] studied 
modeling time-varying dependence in financial markets using the concept of con- 
ditional copula. Patton [62] applied copulas to model asymmetric dependence in 
the joint distribution of stock returns. Hu [32] used copulas to study the structure 
of dependence across financial markets. Miller and Liu [58] proposed methods for 
rc;covc!ry of multivariate joint distributions and copulas from limited information 
using entropy and other information theoretic concepts. 

The multivariate measures of dependence and the copula-based approaches to 
dependence modeling are two interrelated parts of the study of joint distributions 
of r.v.'s in mathematical statistics and probability theory. A problem of fundamen- 
tal importance in the field is to determine a relationship between a multivariate 
cumulative distribution function (cdf) and its lower dimensional margins and to 
measure degrees of dependence that correspond to particular classes of joint cdf 's. 
The problem is closely related to the problem of characterizing the joint distribu- 
tion by conditional distributions (see Gouricroux and Monfort [28]). Remarkable 
advances have been made in the latter research area in recent years in statistics 
and probability literature (see, e.g., papers in Dall'Aglio, Kotz and Salinetti [13], 
Benes and Stepan [6] and the monographs by Joe [44], Nelsen [60] and Mari and 
Kotz [56]). 

Motivated by the recent surge in the interest in the study and application of de- 
pendence measures and related concepts to account for the complexity in problems 
in statistics, economics, finance and risk management, this paper provides the first 
characterizations of joint distributions and copulas for multivariate vectors. These 
characterizations represent joint distributions of dependent r.v.'s and their copulas 
as sums of ^/-statistics in independent r.v.'s. We use these characterizations to in- 
troduce a unified approach to modeling multivariate dependence and provide new 
results concerning convergence of multidimensional statistics of time series. The re- 
sults provide a device for reducing the analysis of convergence of multidimensional 
statistics of time series to the study of convergence of the measures of intertemporal 
dependence in the time series (e.g., the multivariate Pearson coefficient, the relative 
entropy, the multivariate divergence measures, the mean information for discrimi- 
nation between the dependence and independence, the generalized Tsallis entropy 
and the Hellinger distance). Furthermore, they allow one to reduce the problems of 
the study of convergence of statistics of intertemporally dependent time series to 
the study of convergence of corresponding statistics in the case of intertemporally 
independent time series. That is, the characterizations for copulas obtained in the 
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paper imply results which associate with each set of arbitrarily dependent r.v.'s a 
sum of {/-statistics in independent r.v.'s with canonical kernels. Thus, they allow 
one to reduce problems for dependent r.v.'s to well-studied objects and to transfer 
results known for independent r.v.'s and [/-statistics to the case of arbitrary depen- 
dence (see, e.g., Ibragimov and Sharakhmetov [36-40], Ibragimov, Sharakhmetov 
and Cecen [41], de la Pena, Ibragimov and Sharakhmetov [16, 17] and references 
therein for general moment inequalities for sums of [/-statistics and their particular 
important cases, sums of r.v.'s and multilinear forms, and Ibragimov and Phillips 
[35] for a new and c;onccptually simple method for obtaining weak convergence of 
multilinear forms, [/-statistics and their non-linear analogues to stochastic integrals 
based on general asymptotic theory for semimartingales and for applications of the 
method in a wide range of linear and non- linear time series models). 

As a corollary of the results for copulas, we obtain new complete characteriza- 
tions of important classes of dependent r.v.'s that give, in particular, methods for 
constructing new copulas and modeling various dependence structures. The resiilts 
in the paper provide, among others, complete positive answers to the problems 
raised by Kotz and Seeger [47] concerning characterizations of density weighting 
functions (d.w.f.) of dependent r.v.'s, existence of methods for constructing d.w.f.'s, 
and derivation of d.w.f.'s for a given model of dependence (see also [58] for a dis- 
cussion of d.w.f.'s). 

Along the way, a general methodology (of intrinsic interest within and outside 
probability theory, economics and finance) is developed for analyzing key measures 
of dependence among r.v.'s. Using the methodology, we obtain sharp decoupling 
inequalities for comparing the expectations of arbitrary (intcgrabk^) functions of 
dependent variables to their corresponding counterparts with independent variables 
through the inclusion of multivariate dependence measures. 

On the methodological side, the paper shows how the results in theory of 
[/-statistics, including inversion formulas for these objects that provide the main 
tools for the argument for representations in this paper (see the proof of Theorem 1), 
can be used in the study of joint distributions, copulas and dependence. 

The paper is organized as follows. Sections 2 and 3 contain the results on general 
characterizations of copulas and joint distributions of dependent r.v.'s. Section 4 
presents the results on characterizations of dependence based on [/-statistics in in- 
dependent r.v.'s. In Sections 5 and 6, we apply the results for copulas and joint 
distributions to characterize different classes of dependent r.v.'s. Section 7 contains 
the results on reduction of the analysis of convergence of multidimensional statis- 
tics of time series to the study of convergence of the measures of intertemporal 
dependence in time series as well as the results on sharp decoupling inequalities 
for dependent r.v.'s. The proofs of the results obtained in the paper are in the 
Appendix. 

2. General characterizations of joint distributions of arbitrarily 
dependent random Vciriables 

In the present section, we obtain explicit general representations for joint distribu- 
tions of arbitrarily dependent r.v.'s absolutely continuous with respect to products 
of marginal distributions. Let F/. : R — » [0, 1], fc = 1, . . . , n, be one-dimensional 
cdf's and let ^i, . . . ,^ri be independent r.v.'s on some probability space (17, 3, P) 
with P{^k £ Xk) = Fk{xk), x/c € R, fc = 1, . . . , n (we formulate the results for 
the case of right-continuous cdf's; however, completely similar results hold in the 
left-continuous case). 
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In what follows, F{xi, . . . , Xn), Xi € R, i = 1, . . . ,n, stands for a function satis- 
fying the following conditions: 

(a) F{xi, . . . ,Xn) = P{Xi < xi,. . . ,Xn < Xn) for some r.v.'s Xi, . . . ,X„ on a 
probability space S, P); 

(b) the one-dimensional marginal cdf 's of F are Fi , . . . , F„ ; 

(c) F is absolutely continuous with respect to dF{xi) ■ ■ ■dFn{xn) in the sense 
that there exists a Borel function G : R" [0, oo) such that 

F{xi,...,Xn)= f ■■■ I G{tl,...,tn)dFiiti)---dFn{tn). 
J —oo J — oo 

As usual, throughout the paper, we denote G in (c) by -jp^r^jp^- addi- 
tion, F{xj^,...,Xj^), 1 < ji < ••■ < jk < n, k = 2,...,n, stands for the 
fc-dimensional marginal cdf of F{xi, . . . ,a;„). Also, in what follows, if not stated 

dF(x ■ X ■ ) 

otherwise, .1,"'' ,'' , 1 < ji < • • • < jfe < n, = 2, . . . ,n, is to be taken to be 1 
if at least one Xj^,. . ., Xj^ is not a point of increase of the corresponding Fj^ , Fj^. 
(that is, if [xj-^ , . . . ,Xj^) is not in the support of dFj^ ■ ■ ■ dFj^). 

Throughout the paper, the functions g appearing in the representations obtained 

are assumed to be Borel measurable. 

Theorem 2.1. A function F : R" [0,1] is a joint cdf with one- dimensional 
marginal cdf's Fk{xk), Xk G R, k = l,...,n, absolutely continuous with respect 
to the product of marginal cdf's 11^=1 Pk{xk), if and only if there exist functions 
.9ji,...,v : R'' ^ R, I < ii < ■ ■ ■ < ic < n, c = 2, . . . ,n, satisfying conditions 

Al (integrability): 

-K|5ii,...,ictei.---.6c)l < oo> 

A2 (degeneracy); 

^{9ii,---,ic (Cii ) • • • ) ^ik-i > ^ik ' ^ik+1 ) • • • ) ) ICii ) • • • ) ^ik-i > ) • • • ) Cic) ~ 



5ii,...,ic (Cii 7 • ■ • ! S,ik-i ! -^ifc ) Cifc+i ) • • • ) iic)^-^ik {"^ik ) — 0' (ci.S.) 

> 

1 < ii < ■ ■ ■ < ic < n, k = 1,2, . . . ,c, c = 2, . . . , n, 
A3 (positive definiteness) ; 

n 

c=2 l<ii<---<ic<n 

and such that the following representation holds for F : 

■■■ {l + Un{tl,...,tn))J{dFi{ti). 

OO J — oo i=\ 

Moreover, gi^,...,iSiii, ■ ■ • >Cic) = /ii,...,ic(Ciu • • • )Cic) (as-), 1 < ii < ■ ■ ■ < ic < n, 
c = 2, . . . ,n, where 



fe=2 l<ji<-<jke{ii,..;ic} 
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Remark 2.1. It is not difficult to see that if r.v.'s Xi, . . . ,Xn have a joint cdf 
given by (|2.ip tlien tlie r.v.'s Xj^ , . . . , Xj^^ , 1 < ji < ■ ■ ■ < jk < n, k = 2, . . . ,n, 
fiave the joint cdf 

k 
i=l 

witli the same functions gij^,...,i^, and where Bk = {ji, . . . ,jk}- 

Theorem 12.11 can be equivalently formulated as in the following remark. 

Remark 2.2. A function F : R" [0, 1] is a joint cdf with the one-dimensional 
marginal cdf's Fk{xk), Xfe G R, /c = 1, . . . , n, absolutely continuous with respect 
to the product of marginal cdf's 0^=1 Pk{xk), if and only if there exist functions 
■ R-'^ R, 1 < ii < ■ ■ ■ < ic < n, c — 2, . . . ,n, satisfying conditions A1-A3 
and such that the element of frequency dF{xi, . . . ,Xn) can be expressed in the form 

n 

(2.2) dF{xi, . . . ,Xn) =Y[dF,{x^){l + Un{xi, . . . ,Xn)). 

i=l 

Remark 2.3. Sharakhmetov [69] provided proof of (|2.2p in the case of density 
functions (of r.v.'s absolutely continuous with respect to Lebesgue measure), with 
a mention that a similar representation holds for distributions of discrete r.v.'s. 
The setup considered in this paper includes (among others) the class of vectors of 
dependent absolutely continuous and discrete r.v.'s as well as vectors of mixtures 
of absolutely continuous and discrete r.v.'s. Furthermore, our proof easily extends 
to the case of general Banach spaces, in particular, the spaces R*^. 

3. Applications to copulas 

Let us start with the definition of copulas and the formulation of Sklar's theorem 
mentioned in the introduction (see e.g., [22] and [60]). 

Definition 3.1. A function C : [0,1]" — > [0,1] is called a n-dimensional copula if 
it satisfies the following conditions: 

1. C(ui, . . . , Un) is increasing in each component Ui. 

2. C{ui, . . . , Ufe_i, 0, Wfc+i, . . . , Un) — for all G [0, 1], i 7^ fc, fc = 1, . . . , n. 

3. C(l, . . . , 1, Ui, 1, . . . , 1) — Ui for all Ui G [0, 1], i = 1, . . . , n. 

4. For aU (ai, . . . , a„), (&i, . . . , 6„) G [0, 1]" with a,; < bi, 

2 2 

E • • • E {-^y'^-+'"C{xu, , . . . , X.mJ > 0, 

where Xji — aj and Xj2 = bj for all j G {1, . . . ,n}. Equivalently, C is a n-dimen- 
sional copula if it is a joint cdf of n r.v.'s each of which is uniformly distributed on 
[0,1]. 

Definition 3.2. A copula C : [0,1]" [0,1] is called absolutely continuous if, 
when considered as a joint cdf, it has a joint density given by c?C"(ui . . . ,u„)/ 
dui ■ ■ ■ dun- 



F[Xj-^ , ■ 



(1+E E 5^l,...,^c(^^l,■ 

c=2 {n<...<ie}e-Bfc 
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Theorem 3.1 (Sklar [72]). If Xi, . . . , Xn are random variables defined on a com- 
mon probability space, with the one- dimensional cdf's Fxf.{xk) — P{Xk < Xk) and 
the joint cdf Fxi....,x„ixi, ■ ■ ■ ,Xn) = P{Xi < xi, . . . , Xn < Xn), then there exists 
an n-dimensional copula Cxi,...,x„ (wi, . . . , u„) such that Fxi,....x„{xi, ■ ■ ■ , Xn) — 
Cxu-:X„{Fxi{xi), . . . ,Fx„(x„)) for all Xk e Tl, k = 1, ... ,n. 

The following theorems give analogues of the representations in the previous 
section for copulas. Let Vi, . . . ,Vn denote independent r.v.'s uniformly distributed 
on [0,1]. 

Theorem 3.2. A function C : [0,1]" — > [0,1] is an absolutely continuous 
n-dimensional copula if and only if there exist functions gi^,...^i^ '. R*^ — > R, 1 < 
ii < ■ ■ ■ < ic < n, c — 2^ ...,n, satisfying the conditions 
A4 (integrability) ; 

1 .1 

• • • / \gii,...,ia{tii, ■ ■ ■ ,tij\dti^ ■ ■ -dti^ < 00, 
Jo Jo 

A5 (degeneracy); 

= / g^,,...,^AV^l,■■■,V^,-l,t^,,V^k + l,■■■,V^Jdt^^ =0{a.S.), 

Jo 

1 < ii < ■ • ■ < ic < n, fc = 1, 2, . . . , c, c = 2, . . . , n, 
A6 (positive definiteness): 

n 

Un{Vi,...,Vn) = Y^ J2 ~g^u■■■,^AV^,,■■■,V,J>-l{a.S.) 

c=2 l<ii < - <ie<n 



an 



d such that 



(3.1) C(mi,...,M„) = / •■•/ {l + Unitu...,tn))T[dt 

Jo Jo 



Theorem 13.21 and Sklar's theorem formulated above imply the following repre- 
sentation for a joint distribution of r.v.'s. 

Theorem 3.3. A function F : R" —>■ [0, 1] is a joint cdf with one- dimensional 
marginal cdf's Fk{xk), Xk G R, k = l,...,n, absolutely continuous with respect 
to the product of marginal cdf's Y[k=i Pk{xk) if and only if there exist functions 
gii,...,ia ■ [0, 1]*^ ^ R, 1 < ii < ■■■ < ic ^ n, c — 2, . . . ,n, satisfying conditions 
A4-A6 and such that the following representation holds for F: 



(3.2) F{xi,...,Xn) 



/ ••• / {l + Un{ti,...,tn))Y[dt„ 

Jo Jo 



or, equivalently, if and only if the element of joint frequency dF can be expressed 
in the form 

n 

dF{xi, ...,Xn) =\{dF^{xi){l + Un{Fi{xi), Fn{Xn))). 
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Remark 3.1. The functions g and g in Theorems l2.m3.3] are related in the following 
way: gi^,...,iSxn, ■ ■ ■ ,XiJ = gi-,,...,iSFii{xi^), ■ ■ ■ ,Fi^{xiJ). 

Theorems I2.1H3.3I provide a general device for constructing multivariate copu- 
las and distributions. E.g., taking in (13. 1|) and (|3.2p n ~ 2, 51,2(^1,^2) = a(l — 
2ti)(l — 2^2), a e [—1,1], we get the family of bivariate Eyraud-Farlie-Gumbel- 
Morgenstern copulas Cq(ui, W2) = uiU2(l + a(l — ui)(l — U2)) and corresponding 
distributions Fa{xi,X2) = i^i(a;i)i^2(a;2)(l + Q!(l — Fi{xi)){l — F2{x2))- More gen- 
erally, taking gi^^...^i^ {U^ , . . . , i^^) — 0, 1 < ii < ■ ■ ■ < ic < n, c = 2, . . . , n — 1, 
5i,2,...,n(^i, ^2, ■ ■ ■ ,tn) = "(1 — 2ii)(l — 2t2) • • • (1 — 2t„), we obtain the multivari- 
ate Eyraud-Farlie-Gumbel-Morgenstern copulas Cq(ui, U2, ■ ■ • , Wn) = YYi=i + 
aj|"^-^(l — Ui)) and corresponding multivariate cdf's Fa(xi,X2, ■ ■ ■ ,Xn) = 

nLi>,(x,)(i+anr=i(i-^.(^o))- 

Let aij....,,;^ £ R be constants such that X]c=2 Si<ii<...<i <n '^n, ^ 
(^ii • • • <5i, > -1 for aU £ {0, 1}, i = 1, . . . , n. The choice gii,...,iSUi-, ■ ■ ■ ,U,) = 
aii,...,i<,(l - 2iiJ(l - 2ti^) • • • (1 - 2tiJ, I < ii < ■ ■ ■ < ic < n, c = 2, . . . ,n, gives 
the following generalized multivariate Eyraud-Farlie-Gumbel-Morgenstern copulas 
(see Johnson and Kotz [45] and Cambanis [10]): 

n n 

(3.3) C{ui,...,Un)^'[[uk{l + Y^ a»i....,ic(l - WjJ) 

fc=l c=2 l<ii< - <ic<n 

and the corresponding cdf's 

n n 

The importance of the generalized Eyraud-Farlie-Gumbel-Morgenstern copulas 
and cdf's stems, in particular, from the fact that, as shown in Sharakhmetov and 
Ibragimov [70], they completely characterize joint distributions of two- valued r.v.'s. 

Taking n — 2, 31,2(^1,^2) — Oc{ti,t2), where c is a continuous function on the 
unit square [0, 1]^ satisfying the properties jj^ c{ti,t2)dti = c{ti,t2)dt2 = 0, 1 + 
6*0(^1, ^2) > for all < ii, ^2 < 1, one obtains the class of bivariate densities studied 
by Riischendorf [67] and Long and Krzysztofowicz [53] (see also [56, pp. 73-78]) 
f{xi,X2) — fi{xi)f2{x2){^ + Sc{Fi{xi), F2{x2))) with the covariance characteristic 
c and the covariance scalar 9. Furthermore, from Theorems 12. H - I5T51 it follows that 
this representation in fact holds for an arbitrary density function and the function 
9c{ti,t2) is unique. 

4. Prom dependence to independence through (7-statistics 

Denote by Qn the class of sums of f7-statistics of the form 

n 

(4.1) t/„(^i,...,^„) =^ J2 ff^l,...,^c(6l,•■•,6J, 

c=2 l<ii < - <ic<n 

where the functions gi^^...^i^, 1 < ii < ■ ■ ■ < ic < n, c ~ 2, . . . ,n, satisfy conditions 
A1-A3, and, as before, ^1, . . . are independent r.v.'s with cdf's Fk{xk), Xk G R, 
k = 1, . . . ,n. 

The following theorem puts into correspondence to any set of arbitrarily depen- 
dent r.v.'s a sum of [/-statistics in independent r.v.'s with canonical kernels. This 
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allows one to reduce problems for dependent r.v.'s to well-studied objects and to 
transfer results known for independent r.v.'s and {/-statistics to the case of arbi- 
trary dependence. In what follows, the joint distributions considered are assumed to 
be absolutely continuous with respect to the product of the marginal distributions 

Theorem 4.1. The r.v. 's Xi, . . . , X„ have one- dimensional cdf's Fk{xk), Xk G R, 
k = 1, . . . ,n, if and only if there exists Un G Gn such that for any Borel measurable 
function f : R" R for which the expectations exist 

(4.2) EfiXi, . . . ,X„) = . . . + Uni^i, . . . ,^„)). 



Note that the above Theorem 14. II holds for complex-valued functions / as well as 
for real-valued ones. That is, letting /(xi, . . . , Xn) = exp(i J2k=i ^kXk), tk €'R, k — 
1, . . . ,n, one gets the following representation for the joint characteristic function 
of the r.v.'s Xi, . . . , Xn ■ 

Eexp I i^tkXk I = Eexp I i^tk^k | +-Bexp ii^tk^k | Un{£,i, ■ ■ ■,^n)- 

\ k=l ) \ k=\ ) \ k=\ ) 



5. Characterizations of classes of dependent random variables 

The following Theorems I5.1H5.8I give characterizations of different classes of de- 
pendent r.v.'s in terms of functions g that appear in the representations for joint 
distributions obtained in Section 2. Completely similar results hold for the functions 
g that enter corresponding representations for copulas in Section 3. 

Theorem 5.1. The r.v. 's Xi, . . . ,X„ with one- dimensional cdf's Fk{xk), Xk G R, 
k — 1, . . . ,n, are independent if and only if the functions gi^,...,i^ in representations 
p.ip and (j2.2p satisfy the conditions 

9iu...,ia{^ii, ■ ■ • ,6o) = (a.s.), 1 < ii < ■ ■ ■ < ic < n,c^2, . . . ,n. 

Theorem 5.2. A sequence of r.v.'s {Xn} is strictly stationary if and only if the 
functions gi-^^,,,,i^ in representations ^2.1^ and ()2.2|) for any finite- dimensional dis- 
tribution (see Remark 12.1)] satisfy the conditions 

gii-i-h,...,i^+h{^ii,- ■ -^^ij = 9ii....Aj^n,- ■ • (as.) I <ii < ■ ■ ■ < ic < n, 

c = 2,3,...,/i = 0,l,... 

Theorem 5.3. A sequence of r.v. 's {Xn} with EXk — 0, EX^, < oo, fc = 1, 2, . . 
is weakly stationary if and only if the functions g in representations (j2.1[) and 
(I2.2p for any finite- dimensional distribution have the property that the function 
h{s,t) = E^s(.tgsti^s,^t), depends only on \t - s\, t, s = 1,2, . . . 

Definition 5.1. The r.v.'s Xi,...,Xn with EXi — 0, i — l,...,n, are called 
orthogonal if EXiXj ~ for all 1 < i < j < n. 

Theorem 5.4. The r.v. 's Xi, . . . , Xn with EXk — 0, k = 1, . . . ,n, are orthogonal if 
and only if the functions g in representations (j2.ip and (I2.2p satisfy the conditions 

Ei^iJg^J{i^,^J) = 0, 1 < i < j < n. 

Definition 5.2. The r.v.'s Xi, . . . ,Xn are called exchangeable if all n\ permuta- 
tions . . . , XTr(„)) of the r.v.'s have the same joint distributions. 
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Theorem 5.5. The identically distributed r.v. 's Xi, . . . , X„ are exchangeable if and 
only if the functions gi-^^,,,_i^ in representations (12. ip and (j2.2p satisfy the conditions 

5ii,...,ic(6i, ■ • ■ ,Cic) = ffi„(i),...,V(c)(?Mi)> • ■ • ("--s-) for all 1 < ii < ■ ■ ■ < 
ic < n, c = 2, . . . ,n, and all permutations n of the set {1, . . . ,n}. 

Definition 5.3. The r.v.'s Xi, . . . , X„ are called m-dependent [1 <m < n) if any 

two vectors {Xj^ , Xj^ , ■ • • , Xj^_^ , Xj^ ) and {Xj^^^ , -'^j^+a ? • • • ; ^ji-i : ^ji ) i where 
1 < ji <■■■< ja <■■■< ji < n, a = 1,2, ... ,1 - 1, I = 2, ... ,n, ja+i - ja > m, 
are independent. 

Theorem 5.6. The r.v. 's Xi, . . . , Xn are m-dependent if and only if the functions 
g in representations \2.1\ and (|2.2p satisfy the conditions gi-^^...^ii^^i^^-^^...^i^{^i-^, ... , 

^^k^£.^k + ll■■■'S,^J = 3»i fei + i ,.. .,Jo + i C»c ) V 0,11 I < il < 

■■ ■ < ik < ik+i < ■ ■ ■ < ic < n, ik+i - ik > m, k ^ 1, . . . , c - 1, c = 2, . . . , n. 

Definition 5.4. The r.v.'s Xi, . . . , Xn form a multiplicative system of order a G N 
(shortly, MS{a)) if |" < oo, j = 1, . . . , n, and n"=i X"' = lYJ=i EX"' for 
any Uj G {0, 1, . . . , a}, j = 1, . . . , n. 

The systems AjfS'(l) and AIS{2) under the names multiplicative and strongly 
multiplicative systems, respectively, were introduced by Alexits [2]. Multiplicative 
systems of an arbitrary order were considered, e.g., by Kwapieh [48] and Sharakhme- 
tov [68]. Examples of the multiplicative systems MS{1) are given, besides inde- 
pendent r.v.'s, by the lacunary trigonometric systems {cos27rnfca;,sin27i"nfeX, /c — 
1,2,.. .} on the interval [0, 1] with the Lebesgue measure for rik+i/rik > 2 impor- 
tant in Fourier analysis of time series and also by such important classes of depen- 
dent r.v.'s as martingale-difference sequences. Examples of strongly multiplicative 
systems (that is, the systems MS (2)) are given by the lacunary trigonometric sys- 
tems for Uk+i/rik > 3 and martingale-difference sequences Xi,...,X„ satisfying 
the conditions E{Xn\Xi, . . . , Xn-i) = 6^ G R, n = 1, 2, . . .. Examples of the sys- 
tems MS{a) include, for instance, the lacunary trigonometric systems with large 
lacunas, that is, with Uk+i/rik > a + 1 and also e-independent and asymptotically 
independent r.v.'s introduced by Zolotarev [78]. 

Theorem 5.7. The r.v. 's Xi, . . . , Xn form a multiplicative system of order a if and 

only if the functions gi-^ in representations ()2.ip and (|2.2p satisfy the conditions 

•■•^j"'"5»i,...,Jctei>---:CjJ = 0, 1 < il < • • • < ic < n, c = 2, . . . ,n, ttj G 
{0,1,..., a}, j = l,...,n. 

Definition 5.5. The r.v.'s Xi, . . . , Xn are called r-independent (2 < r < n) if any 
r of them of are jointly independent. 

Theorem 5.8. The r.v. 's Xi, . . . , Xn are r-independent if and only if the functions 
9ii,...,ic ™ representations ()2.1|) and ()2.2p satisfy the conditions (Cii , . . . , 

Cic) — (a.s.), 1 < il < • • • < ic < n, c = 2, . . . , r. 

Remark 5.1. Let i^i(x), . . . , Fn{x) be arbitrary one-dimensional distribution func- 
tions, ai,...,a„ G (-1,1) \{0}, J2"=i\'^i\ < 1- Taking gi^^,,,^iXti^, . . . ,ti^) = 0, 
1 < il < ••• < ic < n, c = 2,...,n, c ^ r -|- 1, gi^„,,^i^_^^{ti^, . . . ,ti^^^J = 
C::Z+^ ^0' + ^K - {k + 2)t':f)---i{k + l)tl - (fc + 2)<+i), k = 0,1,2,..., in 
Theorem 13.31 we obtain the following extensions of the examples of r-indcpendcnt 
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r.v.'s obtained by Wang [77]: For fc = 0, 1, 2, . . ., 

F(xi, . . .,x,,) =f[F,{x,) [ 1 + {F^{x.,)^F^\x.J) . 

1=1 \ l<li<...<4r+l<» 



(Wang's examples are with fc = 0). 



6. Further applications: a structural property of multiplicative systems 

The following theorem shows that r.v.'s forming a multiplicative system of order a 
and taking not more than a+1 values are jointly independent. Let card{Ai) denote 
the number of elements in (finite) sets Ai. 

Theorem 6.1. Let a G N, and let Ai, i = 1, . . . ,n, be sets of real numbers such that 
card{Ai) < a + 1, i — 1, . . . ,n. The r.v. 's Xi, . . . , Xn taking values in Ai, . . . , An, 
respectively, form a multiplicative system of order a if and only if they are jointly 
independent. 

Remark 6.1. From Theorem 16.11 with a — 1 the following result obtained in [70] 
follows: A sequence of r.v.'s {Xn} on a probability space (fi, 3, P) assuming two 
values is a martingale-difference with respect to an increasing sequence of cr-algebras 
So = {p., 0) C C • ■ • C if and only if the r.v.'s {Xn} are jointly independent. 
In addition, we obtain that if a sequence of r.v.'s {X„} assuming three values is a 
martingale-difference with respect to (S„) such that i?(X^|S„_i) = 6^ G R, then 
the r.v.'s are jointly independent. 

7. Measures of dependence and sharp moment and probability 
inequalities for dependent random variables 

In this section, we apply the results from Section 2 to study properties of differ- 
ent measures of dependence and convergence of multidimensional statistics of time 
series. We obtain results that allow one to reduce the analysis of convergence of 
statistics of time series to the study of convergence of the measures of intertem- 
poral dependence in the time series and limit behavior of the statistics in the case 
of independence. We also prove new sharp complete decoupling inequalities for de- 
pendent r.v.'s in terms of their dependence characteristics. The theory of complete 
decoupling inequalities has experienced an impetus in recent years. The interested 
reader should consult de la Peha [14], de la Peha and Gine [15] and de la Peha 
and Lai [18] (a survey) for more on the subject. Let Xi, . . . , Xn be r.v.'s with one- 
dimensional cdf's Fk{xk), k = \,...,n, and joint cdf F(x\, . . . ,x,-^. Recall that 
G(x\, . . . ,Xn) = dF(xi, . . . ,Xn) / YYi^i dFi and consider the following measures of 
dependence for the r.v.'s Xi, . . . , Xn ■ 

,Xn)dF{xi, . . .,Xn) - 1 
n 

■ ,Xn)WdFi{Xi) - 1 

1=1 




194 



V. H. de la Pena, R. Ibragimov and S. Sharakhmetov 



(multivariate analog of Pearson's cp'^ coefficient), and 

/oo /"OO 
■•• / log{G{xi, . . . ,x„))dF{xi, ...,x„) 
-oo J — oo 

(relative entropy), where the integral signs are in the sense of Lebesgue-Stieltjes 
and G{xi , . . . , Xn) is taken to be 1 if (a^i , . . . , a;„) is not in the support of dFi ■ ■ ■ dFn. 
In the case of absolutely continuous measures 6xi,...,x„ and 

^^Xi x„ were introduced by Joe [42, 43]. In the case of two r.v.'s Xi and X2 
the measure -^^ was introduced by Pearson [63] and was studied, among oth- 
ers, by Lancaster [49-51]. In the bivariate case, the measure 5xi,X2 is commonly 
known as Shannon or KuUback Lcibler mutual information between Xi and X2. 
It should be noted (see [43]) that if {Xi,...,X„y ~ N{n,Y,), then = 

|ii(27„ -ii) I -1/2-1, where 7„ is the nxn identity matrix, provided that the corre- 
lation matrix R corresponding to E has the maximum eigenvalue of less than 2 and 
is infinite otherwise {\A\ denotes the determinant of a matrix A). In addition to that, 
if in the above case diag(S) = {af, . . . ,cr^), then = ~-51og(|S|/f|^^ erf). 

In the case of two normal r.v.'s Xi and X2 with the correlation coefficient p, 
{^\,,xj{^ + ^\,,x.))"' = (1 - exp(-2<5^,^J)V2 = 

The multivariate Pearson's coefficient and the relative entropy are particular 
cases of multivariate divergence measures D'^^ = • • • il){G{xi, . . . , 

Xn)) nr=i dFi{xi), where V is a strictly convex function on R satisfying ^(1) = 
and G(xi, . . . , a:„) is to be taken to be 1 if at least one xi,...,Xn is not a point 
of increase of the corresponding Fi, . . . , Fn- Bivariate divergence measures were 
considered, e.g., by Ali and Silvey [3] and Joe [43]. The multivariate Pearson's (j)"^ 
corresponds to ip{x) = x"^ — 1 and the relative entropy is obtained with ij{x) = 
x\ogx. 

A class of measures of dependence closely related to the multivariate divergence 
measures is the class of generalized entropies introduced by Tsallis [75] in the study 
of multifractals and generalizations of Boltzmann-Gibbs statistics (see also [24, 26, 
27]) 

/^x!,...,x„ = T— / G^-%Xr,...,Xn))^dFi{x,), 
^ Q J— 00 J— 00 

where q is the entropic index. In the limiting case g ^ 1, the discrepancy measure 

p^"^^ becomes the relative entropy Sx^ x„ and in the case g ^ 1/2 it becomes the 

scaled squared Hellinger distance between dF and dFi ■ ■ ■ dFn 

-1 poo />oo ^ 

P^xClx„ = :^i^- ••• / G'/\x^,...,Xn))l[dFi{x,)) = 2H%^_x^ 

{Hxi x„ stands for the Hellinger distance). The generalized entropy has the form 

of the multivariate divergence measures i)^^ ... x„ with^(x) = {l/{l—q)){l—x^~'^). 

In the terminology of information theory (see, e.g., Akaike [1]) the multivariate 
analog of Pearson coefficient, the relative entropy and, more generally, the multi- 
variate divergence measures represent the mean amount of information for discrim- 
ination between the density / of dependent sample and the density of the sample of 
independent r.v.'s with the same marginals fa — Y[k=i .fk{xk) when the actual dis- 
tribution is dependent /(/o, /; $) = / ^{f{x)/fo{x))f{x)dx, where $ is a properly 
chosen function. The multivariate analog of Pearson coefficient is characterized by 
the relation (below, /o denotes the density of independent sample and / denotes 
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the density of a dependent sample) (j)-^ — /(/o,/;$i), where = x; the rela- 

tive entropy satisfies 6 — /(/o,/;$2), where $2(2;) — log(a;); and the multivariate 
divergence measures satisfy ^ — I{fo, /, $3), where $3(x) = ip{x)/x. 

If gii....,iai^in ■ ■ ■ j^ic) functions corresponding to Theorem 12.11 and 

Remark [2.21 then from Theorem l4.1l it follows that the measures Sxi,...,x„ , x > 

-Oxi,...,x„' Pxl,...,x„ (™ particular, 2nx^ .^x„ for q = 1/2) and /(/o,/;$) can be 



written as 



(7.1) 



(7.2) 



5xi,...,x„ = E\ogil + Un{Xi,...,Xn)) 

= E{1 + Uniil, . . . , in)) log(l + (7„(6, 



^Xl,...,x„ = £;(l + (7„(6,...,?„))'-l 

= EUl{^^,...,U) = EUr^{X^,...,X^), 



(7.3) =i?V(l + t/«(Ci,...,?»)), 

(7.4) Pt,...,x,. = (1/(1 - 'Z))(l - ^^(1 + f^«(6, • • • ,Cn))^), 

(7.5) 2H^^^...^^,^ = 1/2(1 -£;(l + ?7„(a,...,e«))'/'), 

(7.6) /(/o, /; <!>) = £;$ (1 + ;7„(ei, . . . , U)) (1 + C^,.(6, • ■ ■ , , 

where t/„(xi, . . . ,a;„) is as defined by (4.1). 

From (|7.2p it follows that the following formula that gives an expansion for 
in terms of the "canonical" functions g holds: (f)\_^ = 

Ec=2Ei<ii< - <ic<n^ff^^i,- -.ic(^»i'-- In particular, in the case of the r.v.'s 
Xi , . . . , Xn with the generalized multivariate Eyraud-Farlie-Gumbel-Morgenstern 

,x^ — 



copulas (|3.3p the measure of dependence is given by (f)j^_^ 

X]c=2 X^i 



2 



It is well known that the mutual information between two r.v.'s Xi and X2 is 
nonnegative (see [12, p. 27]). The multivariate analog of this property for Sxi,...,x„ 
follows from the results obtained by Joe [43]. It is interesting to note that non- 
negativity of 5xi,...,Xn can be easily obtained from ()7.ip : since the function (1 + 
a;)ln(l -I- x) is convex in a; > 0, by Jensen inequality we get Sxi....,x„ > (1 + 
EUn{£,i, ■ ■ ■ , Cn)) log(l -I- EUni^i, ■ ■ ■ , ^n)) = 0. The following theorem gives an in- 
equality between the measures 5xi,...,x„ and 4'xi x„ that generalizes and im- 
proves the results obtained, in the bivariate case, by Dragomir [20] (see also Mond 
and Pecaric [59]). 

Theorem 7.1. The following inequalities hold 

Sx,,...,x^ < log(l + < </'i-i,...,x„- 

The results obtained in the previous sections, in particular. Theorem 14.11 provide 
a device for reducing the analysis of convergence in distribution of double arrays of 
dependent variables to the study of the convergence in distribution of a decoupled 
counterpart plus a measure of dependence in the time series. We apply this idea in 
reduction of the analysis of convergence of multidimensional statistics of time series 
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to the study of convergence of the measures of intertemporal dependence of the time 
series, including the above multivariate Pearson coefficient 0, the relative entropy S, 
the divergence measures D'^ and the mean information for discrimination between 
the dependence and independence /(/o, /; We obtain the following Theorem l7.2l 
which deals with the convergence in distribution of m-dimensional statistics of time 
series. 

Let h : R"* ^ R be an arbitrary function of m arguments, Y be some r.v. and 
let ijj he a convex function increasing on [l,oo) and decreasing on (— oo, 1) with 

■0(1) — 0. In what follows, — > represents convergence in distribution. In addition, 
{Ci} and {^t} stand for dependent copies of {X"} and {Xt}. 

Theorem 7.2. For the double array {X^}, i — 1, . . . ,n,n = 0, 1, . . . let junc- 
tionals 4>l^,, = 4>\^,X^,...,XZ, 5n,n = 5x^,XJ^,...,X-^, D^^^ = ^X^" , . . ' P^l = 

Px^" X" X"' 1 ^ -'^)' '^n.n = n = 0, 1, 2, . . . denote the correspond- 

ing distances. Then, as n ^ oo, if 



and either cfy^ „ ^ 0, 8n,n 0, I?^„ 0, p"n}n — *■ or 7i„_„ ^ as n ^ oo, then 
as n —>■ oo, 

n 

For a time series {Xt}'^Q let the Junctionals = 'p'xt Xt+i Xt+m~i^ ~ 
5xt.Xt+i,....Xt+„,^i, Df = £'xj,Xt+i,...,Xt+„_i' Pt^^ = Pxixt+i,...,Xt+,„_i' 9 ^ (0,1), 
Tit = {\/2pf^y-/'^ , t ~ 0,1,2,... denote the m-variate Pearson coefficient, the 
relative entropy, the multivariate divergence measure associated with the function 
"0, the generalized Tsallis entropy and the Hellinger distance for the time series, 
respectively. 



Then, if, as t ^ oo, 



and either (pl Q, 5t ^ 0, of 0, p'f'' ^ or Tit ^ as t ^ oo, then, as 
t oo, 

h{Xt, Xt+l, . . . , Xt+rn-l) Y. 

From the discussion in the beginning of the present section it follows that in the 
case of Gaussian processes {Xt}^Q with (Xt,Xt+i, Xt+m-i) ~ N{nt,m, ^t.rn), 
the conditions of Theorem [721 are satisfied if, for example, |i?t_m(2/m — -Rt,m)| — > 1 
or \^t.m\/ J27Lo^ '^t+i ^ 1, as t ^ oo, where Rt.m denote correlation matrices 
corresponding to and (cr^, . . . , cft+m-i) — diag(Sf In the case of processes 
{Xt}'^i with distributions of r.v.'s Xi, . . . , X„, n > 1, having generalized Eyraud- 
Farlie-Gumbel-Morgenstern copulas p.3p (according to [70] , this is the case for any 
time series of r.v.'s assuming two values), the conditions of the theorem are satisfied 

if, for example, 0? = E" 2 E,;i<...<»,e{t,t+i,...,t+m-i} "L...,»c ^ as t ^ 00. 

It is important to emphasize here that since Theorem 17.21 holds for Tsallis en- 
tropy and multivariate divergence measures, it allows one to study convergence of 
statistics of time series in the case when only lower moments of the above-mentioned 
tZ-statistics underlying the dependence generating process for the time series exist. 
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Therefore, they provide a unifying approach to studying convergence in "heavy- 
tailed" situations and "standard" cases connected with the convergence of Pearson 
coefficient and the mutual information and entropy (corresponding, respectively, to 
the cases of second moments of the {/-statistics and the first moments multiplied 
by logarithm). 

The following theorem provides an estimate for the distance between the dis- 
tribution function of an arbitrary statistic in dependent r.v. 's and the distribution 
function of the statistic in independent copies of the r.v.'s. The inequality com- 
plements (and can be better than) the well-known Pinsker's inequality for total 
variation between the densities of dependent and independent r.v.'s in terms of the 
relative entropy (see, e.g., [58]). 

Theorem 7.3. The following inequality holds for an arbitrary statistic h{Xi, . . . , 
Xn): 

|P(M^l,...,X„) <X) -P(Ma,...,^n) <X)| 



< 0Js:i,...,x„ max 



The following theorems allow one to reduce the problems of evaluating expecta- 
tions of general statistics in dependent r.v.'s Xi, . . . , Xn to the case of independence. 
The theorems contain complete decoupling results for statistics in dependent r.v.'s 
using the relative entropy and the multivariate Pearson's (j)^ coefficient. The results 
provide generalizations of earlier known results on complete decoupling of r.v.'s 
from particular dependence classes, such as martingales and adapted sequences of 
r.v.'s to the case of arbitrary dependence. 

Theorem 7.4. // / : R" ^ H is a nonnegative function, then the following sharp 
inequalities hold: 

(7.7) i?/(Xi, . . . ,X„) < i?/(^i, . . . ,^„) + <l>x^,...,xSEfHii. ■ ■ ■,£.n)Y'\ 

(7.8) i?/(Xi, ...,Xn)<{l + 0i„...,xJ'/'(^^/'(Ci, • ■ -,60)'/^ 1 > 2, 

(7.9) EfiXi, . . . ,X„) < i?exp(/(a, . . . ,e«)) - 1 + 5x,,...,x^, 

(7.10) Ef{x,, . . . , x„) < (1 + . . . , e«))'/^ q > 1, 

where ?/;(x) = IxI'/^'-i) - 1. 

Remark 7.1. It is interesting to note that from relation (|7.2p and inequality (|7.7p 
it follows that the following representation holds for the multivariate Pearson coef- 
ficient 0Xi,....x„: 

.7,,. , ^ (Jg/(Xi,...,x„)-i;;/(6,...,e„)) 

(7.11) 0xi,...,x„ = max 77T7I77 c \\i/2 ■ 

EPii,,...,i„)<oo 



The following result gives complete decoupling inequalities for the tail probabil- 
ities of arbitrary statistics in dependent r.v.'s. 
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Theorem 7.5. The following inequalities hold: 

P (hiXi, . . . , Xn)>x)<P{hi^,, . . . , C„) > + {Pmu • • • , Cn) > X))' 

P{h{X^,...,Xn)>x) < (1 + 0^,,...,;,J'/' (P(Mei,---,Cn) >x))^/^ 

P(/i(Xi,...,X„) >.x) < (e-l)P(/i(ei,...,C„) >.x)+5x,,...,x„, 



P{h{Xi,...,Xr,)>x) < + 
a; e R, w/iere ipix) = |a;|«/(«-i) - 1. 



(P(/i(Ci,...,U >a;))' ,g> 1, 



8. Appendix: Proofs 

Proof of Theorem \2.1\ Let us first prove the necessity part of the theorem. Denote 



T{X\ , . . . , Xyi) 



{I + Un{tl, . . . ,tn))\{dF,{U). 



Let k e {1, . . . , n}, Xfc G R. Let us show that 

(8.1) T{oo, . . . , CX3, a;fc, oo, . . . , oo) = Fk{xk), 

G R, A; = 1, . . . , n. It suffices to consider the case fc = 1. We have 
T{xi, cxo, . . . , oo) 

/ •••/ {l + Un{h,...,tn))Y[dF,{U) 

-OO J — OG J — OO • 1 



i=l 



= Pl(xi)+^ //•••/ 9^^,...,^At^,,■■■,UJY[dF,{U) 

c=2 l<ii< . <i,<n ^-°° i=l 

= Pi(xi) + E". 

It is easy to see that there is at feast one tg oi t2, ■ ■ ■ , tn among the arguments of 
each of the functions gii,...,i^(ti^, . . . ,ti^), 1 < ii < ■ ■ ■ < ic < n, c = 2, . . . ,rt, 
in the latter summand. By A2 we get, therefore, that S" = 0. Consequently, 
T{xi, oo, . . . , oo) — Fi{xi), xi G R, and (|8.ip holds. It is evident that 



lim T{xi,...,Xk,-.-,Xn) = 



(8.2) 

for all a;j G R, J = 1, . . . , n, J ^ fc, fc = 1, . . . , n. Since 



T(xi, . . . ,Xn) = ]^-Fi(a;j) + E 



from the monotone convergence theorem we obtain that T(a;i, . . . , a;„) is right- 
continuous in (xi, . . . ,a;„) G R". Let 5^^^^^■^T{xl, . . . ,x„) = T(xi, . . . , a;fc_i, 5, x^+i, 
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. . . , Xn) — T{xi, . . . , ifc-i, a, Xk+1, ■ • ■ , Xn), a < b. By integrability of the functions 
Slid condition A3 we obtain (/(•) denotes the indicator function) 



(8.3) 



= Yl P{a, < <b,)+E 



> 



for all fli < 6i, z = 1, . . . , n0 Right-continuity of T(a;i, . . . , x„) and (|8.ip - (|8.3p imply 
that T(xi, . . . , x„) is a joint cdf of some r.v.'s Xi, . . . , X„ with one-dimensional cdf's 
Fk{xk), and the joint cdf T{xi, . . . , x„) satisfies (12.11) . 

Let us now prove the sufficiency part. Consider the functions 

s=2 ii<---<j;,G{»i,...,»c} ^ ^ 

1 < ii < ■■■ < ic < n, c = 2, . . . ,n. Obviously, the functions fi-^^...^i^ satisfy 
condition Al. Let us show that they satisfy condition A2. It suffices to consider the 
case ii — l,i2 — 2, . . . ,ic = c,k = 1. We have 

Egi,2,...A^i^^2, ■■■,Xc) 

3i,2,...,c(a;i,a;2, • ■ .,Xc)dFi{xi) 



, 7 ■ ■ ■ 7 ) 



- 1 



E 

2<ii<-<is<c 



V dFidF,^ ■ ■ ■ dF, 



V dF,,dF,^---dF,, 



dFiixi) 



= E(-i)- E ( 



22 7 ... 5 '^Is 



s=2 



,2<i2<-<js<c 



dF^---dF„ 



1) 



E ( 

2<2i<---<2s<C 



dF{xi^ , . . . , ) 
~lLF~^dF~ 



1)U0. 



By the inversion formula (see, e.g., [8, pp. 177-178]) it follows that if ai^^,,,^i^, 
1 ^ ii < ■■■ < *c £ n, c = 2,...,n, are arbitrary numbers then the 

relations 

n 

= E E 1 < ii < • • • < «c < c = 2, . . . ,n, 

c=2 ji<...<j,6{ii,...,ij 

and 



Y.(-^)- E 



, 1 < ii < • • • < ic < ^, c = 2, . . . , n. 



s=2 



jl<---<j3e{ii,...Aa} 



^Note that Il8.2|l and Il8.3|l are immediate if the probabihty space and the random variables 
are defined in the canonical way with Q = R" and Xi{Lu) = uii for lo = [loi, . . . ,t<Jn) and P{A) = 
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are equivalent. Taking ai^^,,,,i, = gi^,,,,^i^{xi^, . . . ,XiJ, bi^^,,,^i^ = dF{xi^, . . . ,XiJ/ 
Uj=i - 1, for 1 < ii < • • • < < n, c — 2, . . . ,n, we obtain, in particular, that 

(8.4) G{xi, ...,Xn) = l + Un{xi, . . . ,X„). 

Therefore, representation p.ip holds and the functions fi^,...^i^ satisfy condition A3. 
Suppose now that there exists another set of functions gi^,...^i^ satisfying conditions 
A1-A3 and such that (|2.ip holds and, equivalently, (|2.2p holds. Let Bs be a set of 
s integers ji , . . . , js with 1 < ji < • • • < Js < n, for s = 2, . . . , rt. By Remark 1 we 
have 
(8.5) 

s s 

^ ^ ^ ^ ,...,ic (Cil ; ■ ■ ■ ; Cic) ~ ^ ^ 5ii , • • .,ic (Cii 7 ■ ■ • i ) 

c=2 {ji<...<iJgB^ c=2 {ii< - <jJeS, 

(a.s.). From (|83|) we subsequently obtain that /ii,i2(Cn , ^12) = 5ii,i2 (61)^^2) (a.s.), 
1 < zi < 22 < n; /ii,i2,i3(Cii>Ci2>Cj3) = .9ji,i2,i3(Cn>Ci2)63) (a-S-)> 1 < «i < 42 < «3 < 

/l,2....,n(6,C2, • ■ • ,Cn) = 5l,2,. . . ,ri (Cl , 6 , ■ • ■ , Cn) (a.S.), that is gii,. (^i , . . . , 

Cic) = fiu...AAiiii ■ ■ ■ i^ia) (a.s.), 1 < ii < ■ ■ ■ < ic < n, c ^ 2, . . . ,n. This completes 
the proof. □ 

Proof of Theorems \S.2\\4.1\ By definition, a function C : [0, 1]" ^ [0, 1] is a 
n-dimensional copula if and only if it is a joint cdf of n r.v.'s each of which is 
uniformly distributed on [0, 1]. Let, as in Section 3, Vi, . . . ,Vn denote independent 
r.v.'s with uniform distribution on [0,1]. From Theorem 12.11 we obtain that C : 
[0, 1]" — > [0, 1] is an absolutely continuous copula if and only if there exist functions 
■ R."^ — > R, 1 < ii < • ■ ■ < ic < c = 2, . . . , n, satisfying the conditions 

E\gi,,...,t^{Vi^,...,ViJ\ < oo, 

i?(5n...,^e (V,,,..., V,,_, , V,, , V,,^, ,...,VaVn,..., V,,_, , V,,^, , . . . , 1/,J = (a.s.), 
1 < ii < ■ ■ • < ic ^ k — 1,2, ... ^c^ c = 2, . . . , n, 

n 

5ii....,»<=(V;i,...,FiJ > -1 (a.s.), 

c=2 l<ji <...<ic<n 

and such that representation p.ip holds. This proves Theorem l3.2l Theorem [331 fol- 
lows from Theorem l3.2l and the relation (given by Sklar's Theorem l3.1l) Fxi,...,x„{xi, 
■ ■■,Xn) = Cxt,...,x„{Fxi{xi), ■ ■ ■ .FxS^n)), Xfe S R, 1 , . . . , n, between the 
joint distribution functions Fxi,...,x„(a;i, . . . , a;„) and the corresponding copulas 
Cxi,...,x„(wi, . . . , M„). If J7„ is the JJ-statistic corresponding to the r.v.'s Xi, . . . , X„, 
then for any Borel measurable function / for which the expectations exist, one has 

n 

ii;/(Xi,...,x„) = i?{/(ei,...,60(i + E E 5»i,....vtei,---,6J)} 

c=2 l<ji< - <ic<n 

= i?[/(a,...,60(i + t/«(a,---,Cn))]- 

This and Theorem 12. II implies Theorem 14.11 □ 

Proof of Theorem ] 5. 1\ It is evident that (a^ii , . . . ,Xi^) =0 satisfy condi- 

tions Al~A3. If gii,...,i^(a;ij, . . . ,Xi^) = 0, 1 < ii < • • ■ < ic < n, c = 2, . . . , n, then 
representation (12. ip takes the form 

n 

(8.6) F(a:i,...,a;„) ^n-F^a:,). 
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R.v's with the joint distribution function (j8.6|) are independent. Let now Xi, . . . , Xn 
be independent r.v.'s with one-dimensional distribution functions Fi{xi),i — 
1, . . . , n. Then their joint distribution function has form (|8.6p . This and the unique- 
ness of the functions 3,1, given by Theorem 12.11 completes the proof of the 
theorem. □ 



Proof of Theorems 1 5. S\\5. ^1 Below, we give proofs of Theorems 15.71 and Ej 
The rest of the theorems can be proven in a similar way. Let Xi, . . . , Xn be r.v.'s 
with the joint distribution function satisfying representation (j2.2p with functions 

5ii,...,ic such that ^^4"'' • ■ ■C,"'"5ii,...,j<:tei7 • ■ ■ ,Cjc) = 0, 1 < ii < • • • < < 
c = 2, . . . , n, where ^1, . . . , ^„ are independent copies of Xk-, k — 1,2, ... ,n. Let 
aj G {0, 1, . . . , a}, j = l,...,n. Taking in (g^l) f{xi, . . . , x„) = n"=i ^"j' and 
using independence of the r.v.'s ^1, . . . , ^„ we obtain 

n n n n 

n n c 

(8.7) =n^c+E E ^nc3n,...,..(6.,...,^^j 

j=l c=2 l<ii< - <ie<n fc=l 

X n ^^r- 

fc=l,...,Tl 

^7^21 ,...,ic 

Using •••C3n,...,»c(6i,...,6:J - 0, we get from (E3 EU^^^X^^ = 

JIJ^j^ -^^7^ = Yij=i ex"' , that is the r.v.'s Xi, . . . , Xn form a multiplicative sys- 
tem of order a. Let us suppose now that r.v.'s Xi, . . . ,Xn form a multiplicative 
system of order a, that is, E\Xj |" < 00, j = 1, . . . ,n, and for all aj G {0, 1, . . . , a}, 
j = l,...,n, E HjLi = n"=i EX"' . From Remark O and Theorem O it 
follows that 

r=l r=l c=2 ii<...<i^e{ji,...j^} r=l 

ttjV £ {Oj Ij • • ■ I 1 < Ji < • • • < Jfc < = 2, . . . , rt. Therefore, for all aj^ € 
{0,1, . . .,a}, 1 < ji < ■ ■ ■ < jk < n, k ^ 2, . . . ,n, 

(8-8) E E ^^nc^n...... n ^e=o- 

C=2»i<...<ie€{jl,...,J»} »-=l r=l,...,fc 

From (|8.8p we subsequently obtain that 

^C'C5ji,j2(Cji,C»2) =0,aj,,a,2 e {0,l,...,a},l < ii < 12 < n; 

eCi' 9ti,z2,t3 tel > C»2 J 63) = 0, , , a^,, e {0,1, . . . ,a},l < ii < 12 

<i3<n and E£,"^ ■ • • C""gi,2,...,«(Ci, 6) ■ • ■,Cn),ak G {0,1, ... ,a},k = 1, ... ,n. 

Therefore, -B^"'' • • • C'.9ji,...,ictei : ■ • ■ : Cic) = 0, 1 < ii < • • • < ic < n, c = 
2, . . . , n. Let Xi, . . . , X„ be r-independent r.v.'s. From Remark |2. II it follows that 

F(xj-^, . . . , Xji,) 
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for all = {1 < ji < ■ ■ ■ < jk < «}, k — I, ■ ■ ■ ,r. Using r- independence of X^s 
we subsequently obtain from here that (741.42(^411^12) — (a.s.), 1 < «i < «2 < 

9^u^2,^3{^^l,S.^2l£,^3) = (a-S.), 1 < «1 < «2 < 43 < • ■ • , 5*1 , . . .,V fei , • ■ • , C» J = 

(a.s.), 1 < ii < ■ ■ ■ < ir < n. Therefore, , ■ • ■ , ^v) = (a.s.), 1 < 

ii < ■ ■ ■ < ic < n, c — 2, . . . ,r. Let now Xi, . . . , Xn be r.v.'s such that the func- 
tions gij^^...,i^ in representations (j2.ip and (|2.2p satisfy the conditions gi^^...^i^ (^i^ , ■ . • 7 
^i^) = (a.s.), 1 < ii < • ■ • < ic < c = 2, . . . ,r, that is their joint distribution 
function has the form 

F{^X\ , . . . , Xji) 

= P{Xi <Xi,...,Xn<Xn) 

•••/ (1+ E E 9^u■■■.^AU,,■■■,t^J)l[F,{U). 

-00 c=r+l l<ii<...<,;^<n i=l 

Let 1 < Ji < • • • < jr < f^. Let us show that the r.v.'s Xj-^ , ■ ■ ■ , Xj^ are jointly inde- 
pendent. Without loss of generality, it suffices to consider the case ji = 1, . . . ,jr = r. 
We have, similar to the proof of Theorem l2.11 

F(^Xi , . . . , X|.) 

^P{Xi < XI,. 

— - ^ y: e 5n,.....(^.,...,tj 

c— r+l l<ii<---<ic<ri 




X \{dF,{U) 

i=l 

r n n 

= X{F,{Xi)+ J2 E 9^^,...,^At^l,■■■,t^Jl[F,{U) 

r 

^1[f,{x,) + j:". 

1=1 

It is easy to see that there is at least one of i^+i, ■ ■ ■ ,tn among the arguments of 
each of the functions gi^^...,i^ {tii , ■ ■ ■ , U^), 1 < ii <■■■< ic < n, c — r + 1, ... ,n, in 
the latter summand and, therefore, by A2, E" = 0. Consequently, F{xi, . . . ,Xr) — 
Yii^i Fi{xi) that establishes joint independence of the r.v.'s Xi, . . . , Xr. The proof 
is complete. □ 



Proof of Theorem \ 6.1\ Evidently, if the r.v.'s Xi, . . . ,X„ are jointly indepen- 
dent, then they form a multiplicative system of order a. Let us show that if 
card{Ai) < a + 1, i = 1, . . . , n, and r.v.'s Xi, . . . , Xn form a multiplicative sys- 
tem of order a, then they are jointly independent. It suffices to show that 

n n 

(8.9) El[f,{X,)^l[Ef,{X,) 

1=1 i=l 

for all continuous functions : R ^ R, i = 1, . . . , n, vanishing outside a finite 
interval. Let 1 < i < n. It is easy to see that if fi{x), a; G R, is an arbitrary 
function, then there exists a polynomial ri{x), a; G R, of degree not greater than 
a, such that fi{x) — ri{x), x G Ai, that is fi{^i) — ?'i(^i) (a.s.). Using Theorems 
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14.11 and 15. 7[ we get that for all continuous functions : R ^ R (below, (x) are 
polynomials corresponding to fi(x)) 

n n n n 

n n n 

i^l c^2 l<'ii<---<ic<n 2—1 

n 

The proof is complete. □ 

Proof of Theorem \ 7.1\ From concavity of the functions log(l + x), relation 
(|7.2p and the inequaUty log(l + x) < x, x > 0, we get Sxi,...,x„ — -Elog(l + 
t/„(Xi,...,X„)) < log(l + i?;7„(Xi,...,X„)) = log(l + 02^^,'. .,^J < 
The proof is complete. □ 

Proof of Theorem Let h{^t, S.t+u 6+™-i) ^ Y. By Theorem O we 

have that for any continuous bounded function g : R ^ R 

EgihiXt, . . .,Xt+.ra-i)) = Egihi^u . . . + [/„,(6, . . . ,6+m-l)). 

By Chebyshev's inequality and (|7.2p we have that for all e > 0, 

p(|c/„(Ct,6+i,---,6+m-i)l >e) <0?A'. 

Since the function w{x) = (l+a^) ln(l+a;)— x is increasing in x £ [0, oo) and decreas- 
ing in a; e (-oo,0), we have that ifUmi^t,^t+i, ■ ■ • ,6+m-i) > e or Um{£.t,£,t+i, ■ ■ ■ , 
6+m-i) < -e, then 

w{U„r{^t,£,t+i, ■ ■ . ,Ct+m-i)) > A w{-e)), 

where a Ab = min(a,6). Therefore, by Chebyshev's inequality, (|7.ip and since 
EU,n{£.t, ■ ■ ■ ,6+m-i) = (by condition A2) we get, for < e < 1, 

P(|t/™(6,...,6+m-i)| >e) 

<P(u;(t/™(6,...,6+m-i)) > (wie) Aw{-e))) 
< Ew{U^iit, 6+1, ■ • ■ , 6+m-i))/(u'(e) A wi-e)) 

X log(l + U„,iit, . . . , 6+m-i)) /{w{e) A wi-e)) 
= St/iw{e)Aw{~e)). 

If e > 1, Chebyshev's inequality and Um{S.t, ■ ■ ■ ,$t+m-i) > ^1 yield 

(8.11) P(|[/™(6, . . . , 6+™-i)l > < ^^(^'»fe'y-'^*+'»-i)) = s,/w{e). 

w(e) 

Similar to the above, by Chebyshev's inequality and (|7.3p . for < e < 1, 

P(|t/™(6,...,6+™-i)l >e) < P(VXl+(7m(6,---,6+m-i)) > (^(l + e)A7A(l-e)) 

(8.12) < E^il + Urniit, . . . ,6+™-i))/(V'(l + e) A ^(1-e)) 

^Df/m + e)Aij{l-e)). 
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For e > 1, 



P(|t/™(6,...,6+™-i)| >e) < P(V'(l + (7™(6,...,6+m-i))>^(l + e)) 

< Dfm + e). 



Inequalities (|8.10p - (|8.13p imply that Um{^t,£,t+i, ■ ■ ■ ,Ct+m-i) ^ (in probabil- 
ity) as t — > oo, if 0j — > 0, or 6t 0, or of as t ^ (x. The same argu- 
ment as in the case of the measure of, used with iIj{x) — x^^'^, establishes that 
Umi^t,^t+i, ■ ■ ■ ,Ct+m-i) (in probability) as t ^ oo, if p[''^ ^ as t ^ oo 
for q £ (0,1). In particular, the latter holds for the case q = 1/2, and, conse- 
quently, for the Hellinger distance Tit. The above implies, by Slutsky theorem, that 
Eg{h{Xt, Xt+i, . . . , Xt+m^i)) — *■ Eg{Y) as t oo. Since this holds for any contin- 
uous bounded function we get /i(Xt,Xt+i, . . . ,Xt+rn-i) Y (in distribution) 
as t — > oo. The proof is complete. The case of double arrays requires only minor 
notational modifications. □ 

Proof of Theorem \7.3\ From Theorem 14.11 relation (|7.2p and Holder inequality 
we obtain that for any a; G R and r.v.'s Xi, . . . , Xn 

P{h{Xi, . . . ,X„) <a;)-F(/i(ei, . . .,^n)<x) 

(8.14) = EI{h{(i, . . . ,C„)<x)C/„(a, . . . , e™) 

<4'x,....xAPmi,---.^n)<x))]^' 

P{h{X,, . . .,X„)>x)~Pihi^,, . . .,U>x) 

(8.15) = EI{h{^,, C„) >x)C/„(Ci, ...,^n) 

<^x,,...,xjPmi,---,U>^))'^' 

The latter inequalities imply that for any x G R 

\P{h{X,,...,Xr,)<x)-P{h{^,,...,^^)<x)\ 



< (t>Xu...,x„ max 



(P(/i(ei, . . . , e„) < x))i/2, (P(/i(ei, . . . , Cn) > a;))i/2 



The proof is complete. □ 



Proof of Theorem By Theorem O] we have £;/(Xi, . . . , X„) = 

Ef{£,i, . . . , 6i) + -Et/„(^i, . . . , Cn)/(Ci. ■ • ■ ) Cn)- By Cauchy-Schwarz inequality and 
relation (|7.2p we get 



i?t/„ (a , . . . , 60/(6 , • . • , Cn) < {Eui (a , • . • , Cn)) {Ef'i^i , . . . , a)) 



1/2 



Therefore, (|7.7p holds. Sharpness of (|7.7p follows from the choice of independent 
Xi,. . . ,Xn- Similarly, from Holder inequality it follows that ii q > 1, 1/p+l/q — 1, 
then 

(8.16) Ef{X,, . . . , X„) < {E{1 + C/„(Ci, . . . , Cn))^)'/^(i?/(ei, ■ • ■ , ?«))')'/«. 

This imphes (|71II)) . If in estimate ([5?TB)) g > 2 and, therefore, p G (1, 2], by Theo- 
rem |4?1] Jensen inequality and relation (|7.2p we have 



i^(l + C/„(a,---,Cn))'' = E{l + UniX,,...,X,,))P-^ 
< {l + EUn{X,,...,X,,))P-^ 

= (l + </>x, xj"^'- 
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Therefore, (|7.8[) holds. Sharpness of (|7.8p and (|7.10[) follows from the choice of 
Xi = const (a.s.), i = 1, . . . ,n. According to Young's inequality (see [19, p. 512]), if 
p : [0, oo) "> [0, oo) is a non-decreasing right-continuous function satisfying p{0) = 
limt^o+p(t) = and p(oo) — limt^cc p{t) = oo, and q{t) ~ sup{u : p{u) < t} is a 
right-continuous inverse of p, then 

(8.17) St < (/)(s)-f V(t), 

where 0(t) — p{s)ds and V'(t) — J^q{s)ds. Using (|8.17p with p(t) = ln{l + t) and 
(fTTj) . we get that 

i?^7n(a,---,?n)/(6, ■•■,?») < i?(e^(«^-'«"^)-l-i?/(Cl,...,en) 

+ S(l + C/„(a, . . . ,^„)) log(l + [/„($!, . . .,?«)) 

= £;(e/(«--'«"))-i-i?/(ei,...,e„) 
+ (5x1,. ..,x„- 

This establishes (|7.9p . Sharpness of (|7.9p follows, e.g., from the choice of independent 
X^s and / EE 0. □ 

Proof of Theorem \7.5\ The theorem follows from inequalities (|7.7p - ()7.10p ap- 
plied to f{xi, . . . ,Xn) = I{h{xi, . . . ,Xn) > X). □ 
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